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Following recent work by Marathe and Parrondo [PRL, 104, 245704 (2010)], we construct a 
classical Hamiltonian system whose energy is reduced during the adiabatic cycling of external 
parameters, when initial conditions are sampled microcanonically. Combining our system 
with a device that measures its energy, we propose a cyclic procedure during which energy is 
extracted from a heat bath and converted to work, in apparent violation of the second law of 
thermodynamics. This paradox is resolved by deriving an explicit relationship between the 
average work delivered during one cycle of operation, and the average information gained 
when measuring the system's energy. 
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I. INTRODUCTION 

The Kelvin-Planck statement of the second law of thermodynamics asserts that no process is 
possible whose sole result is the extraction of energy from a heat bath, and the conversion of that 
energy into work. |IJ Because this statement is formulated in terms of energy rather than entropy, 
it provides an attractive starting point for exploring the microscopic foundations of the second law. 
This is particularly true when we consider an immediate corollary of the Kelvin-Planck statement: 
when a thermally isolated system, initially in equilibrium, evolves under a cyclic variation of 
external parameters, its internal energy cannot decrease}]] Since an isolated system exchanges no 
heat with its surroundings, and is governed by familiar equations of motion - Hamiltonian dynamics 
in the classical case, or the Schrodinger equation for a non-relativistic quantum system - relatively 
few theoretical tools are needed to embark on an investigation of this statement. 

Let us formulate the problem as follows. A finite, classical system is described by a Hamiltonian 
H(z;X), where z = (q,p) denotes a point in 2Z)-dimensional phase space, and A = (Ai,--- , X n ) 
is a set of externally controlled parameters. At time t = the system's initial conditions are 
sampled from an equilibrium distribution p eq (z), and then for < t < r the system evolves under 
Hamilton's equations as the parameters are made to trace out a closed loop in A-space. We will 
use the notation A c (i) to denote such a cyclic protocol for varying the parameters, beginning and 
ending at X A = A c (0) = A c (r). The work performed on the system during this process is the net 
change in the value of the Hamiltonian, 

W = H(z T ;X A )-H(z ;X A ), (1) 

where the trajectory zj describes the system's evolution from t = to t = r. Since Hamiltonian 
dynamics are deterministic, the value of W is fully determined by the initial conditions: W = 
W{zq). The Kelvin-Planck statement, viewed as a statistical prediction about averages, then 
implies the inequality, 

(W) = J dz p oq (z )Ty(z ) >0. (2) 

We now ask, for what choices of the equilibrium distribution p eq (z) can this result be established 
rigorously? 

When initial conditions are sampled from a canonical distribution 

p£ n (z)ocexp[-/3i^(z)] , H A (z) = H(z;X A ), (3) 

If its energy were to decrease, then at the end of the process the system could be returned to its initial state by 
equilibrating it with a heat bath at temperature T, resulting in the net conversion of heat to work. 
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Eq. [2] follows directly from the properties of Hamilton's equations [2H3] ■ in fact, this result extends 
to any distribution of initial conditions that is a decreasing function of energy (3j H] . Somewhat 
surprisingly, however, Eq. [2] is not universally valid when initial conditions are sampled from a 
microcanonical distribution, 

p^ n ( Z )oc<^-iJ A (z)] (4) 

This has been discussed by Allahverdyan and Nieuwenhuizen [3], but to the best of our knowledge 
it was Sato [5] who first constructed a counter-example, involving a perturbed, one-dimensional 
harmonic oscillator. For microcanonically sampled initial conditions, Sato described a cyclic vari- 
ation of the Hamiltonian that results in a negative value of average work, (W) < 0. More recently, 
Marathe and Parrondo [BJ have developed another counterexample to Eq. [2j involving a particle 
inside a box with hard walls and an insertable barrier. For a given initial energy, Marathe and 
Parrondo describe a cyclic manipulation of the walls and the barrier, whose net effect is to reduce 
the energy of the system. Ultimately, the particle can be brought arbitrarily close to zero kinetic 
energy by a succession of such cycles, with a different protocol for each cycle. 

Inspired by Ref. [BJ, in the present paper we introduce and analyze another model system 
that violates Eq. [2] We consider a classical particle moving in a one-dimensional potential well, 
described by a pair of external parameters A = (Xl,^r) (see Eq. [H] and Fig. [lj. We will discuss 
the design of protocols for varying these parameters cyclically with time, A c (t), in a manner that 
lowers the energy of the system. In particular, for any choice of initial particle energy Ei, we will 
construct a protocol (which depends on the value of Ei) that reduces the particle's kinetic energy 
arbitrarily close to zero in a single cycle, bringing the system to a final state in which the particle 
sits nearly motionless at the bottom of the potential well. In effect, the system is cooled near to 
"absolute zero" temperature. 

Our model, like those of Refs. [HIE], suggests that a perpetual-motion device of the second kind 
could be constructed, operating by the following steps. 

1. The system is brought into contact and allowed to equilibrate with a thermal reservoir at 
temperature T. The reservoir is then removed. 

2. The energy of the now-isolated system is measured. 

3. The system is subjected to a cyclic protocol that reduces its kinetic energy close to zero (as 
discussed above). 
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By repeatedly performing this sequence of steps, we obtain a scenario in which energy is systemat- 
ically extracted from the reservoir (step 1) and delivered as work to the agent that carries out the 
cyclic protocol (step 3). This is reminiscent of Maxwell's demon [7H5], only here the demon's role is 
to implement a cyclic protocol \ c (t) based on the measured energy of the system, instead of opening 
or closing a trapdoor based on the observed motion of nearby particles. The key to exorcising the 
demon - that is, to reconciling this scenario with the second law of thermodynamics - is to recog- 
nize that the repeated measurements of energy in step 2 result in the accumulation of information. 
In order for the device to satisfy the "sole result" stipulation of the Kelvin-Planck statement (see 
above), this information must eventually be erased. As famously discussed by Landauer [TU], and 
by Bennett in the context of Szilard's engine [12] - another incarnation of Maxwell's demon 
- the erasure of information carries an unavoidable thermodynamic cost of fee? 1 In 2 per bit. We 
will show by explicit calculation that this cost ultimately wipes out any gains made by our device: 
in the process of erasing the accumulated information, all of the work harvested by the device is 
returned as heat to the thermal reservoir. 

In Sec. [n] we introduce our model and discuss protocols A c (t) that reduce the energy of the 



system. In Sec. Ill we discuss the average amount of work that is extracted per cycle, when 
carrying out the three-step procedure discussed above; this amount depends on the precision with 
which the initial energy is measured in step 2. Using Landauer 's principle for the work that must 
eventually be expended to erase the accumulated information (ksT 'In 2 per bit), we will show that 
this is no less than the work extracted in step 3, regardless of the precision with which the initial 
energy is measured. Thus in the final accounting, after all the bits of information are reset to zero, 
the device is unable to deliver work and the second law is rescued from the demon. 

II. MODEL AND PROTOCOLS 

Consider a classical particle of unit mass moving in one dimension, governed by a Hamiltonian 

A L q 2 if q < 



H(z; A) = ^ + U(q; A) = ^ + q 4 



(5) 

X R q 2 if Q > 



where z = (q,p) is a point in the phase space of the particle, and A = (Al,Aj?) is a point in 
two-dimensional parameter space, with \l,Xr > 0. The parameter A^ modulates the shape of 
the potential energy function in the region q < 0: when A^ > 0, there is a local minimum at 
^mm _ —y/\ L /2, as illustrated in Fig. [IJ Similarly, the value of Xr specifies a minimum at 
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coordinate, q 



FIG. 1. The solid curve depicts the potential U(q; 1.0, 0.7), with local minima at q™ m 
g™° = +\/0.35 (sec text). The dashed curve is the unperturbed, quartic potential U(q; 0,0). 



'0.5 and 



(A,A) 



(0,0) l L 

(a) Symmetric protocol. 



(AlAr) 



(0,0) l L 

(b) Asymmetric protocol. 



FIG. 2. The cyclic protocols A c (t), depicted here, proceed clockwise from the origin. 



9i? m = We wn l refer to these regions as the left well and the right well. When A = (0, 0), 

the particle moves in a quartic potential, which we call the unperturbed system. 

Now imagine a protocol A c (t) whereby the parameters are made to trace out the perimeter of the 



square shown in Fig. 2(a) , starting and ending at A = (0, 0). For simplicity we assume a constant 



speed, |dA/dt| = 4A/r. The deformation of the potential during this protocol can be pictured 
as follows. Starting from the unperturbed quartic potential, the right well gradually drops down, 
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U(q)r 




(a) t = 





-2-1012 

q 

(c) t = r/4 



U(q), 






-2-1012 

q 

(d) t = r/2 



(e) t = 3r/4 





q 



(f) t = T 



FIG. 3. Snapshots of the potential energy function as A is varied according to the protocol shown in Fig. 2(a) 



with A = 5.0 (hence E\ = 2.744 and E2 = 6.914, see Eq. [6|. The shaded regions illustrate the evolution of 
sets / and II, in the quasi-static limit r — > 00. 



forming a local minimum that moves from the origin to WA/2 (see Fig. 3(a) - 3(c) ) as \r increases 
from to A. Next, as increases from to A the left well drops down, forming a local minimum 



that comes to rest at — y/A/2, with a local maximum at the origin (Fig. 3(d)). These two stages 
are then undone (Figs. |3(e) 3(f)[ ). The net effect is a piston- like pumping of the right and left 
wells. For this protocol, let Zf denote a trajectory evolving under the time-dependent Hamiltonian 
H(z;X c (t)). 

For a given choice of A, let us define two energy values, 



Si (A) 



3/ ) 



4/3 



A 



E 2 (A) 



3/o J 



(6) 



where 



Jo 



d y yr 



7rr(5/4) 
T(7/4) 



1.74804. 



(7) 



These in turn define three regions of phase space, I, 77, and JiT, according to the value of the 
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unperturbed Hamiltonian Hq(z) = p /2 + q : 



I : < H {z) < E x 
II :E 1 < H (z) < E 2 
III : E 2 < iZo(z) 



(8) 



We now claim that when the protocol A c (t) shown in Fig. 2(a) is implemented quasi-statically, 
r — > oo, then the net effect is to swap regions / and II. That is, trajectories with initial conditions 
zq in region / end with final conditions z T in region II, and vice- versa. (See, however, the discussion 
of subtleties associated with this limit, in Sec. IV ) Fig. [3] and the following paragraphs convey how 
this swap proceeds. For convenience, we will use the terms set I and set II to refer to trajectories 
with initial conditions in regions I and II of phase space, respectively. The shaded regions in Fig. [3] 
depict the evolution of these sets of trajectories, as a sequence of snapshots from t = to t = r. 
By Hamilton's equations we have 



;H(z t ;X t 



dX dH 



( z t; A t ) 



-it 



X L 0(-qt) + X R O(+q t ) 



(9) 



dt K " dt d\ 

where #(•) is the unit step function. During the first stage of the process, < t < r/4, we have 
Xl = and Xr > 0, therefore as the right well drops down the value of H{zt, At) decreases whenever 
qt > 0. As a result, some trajectories acquire negative energies {H < 0) and become trapped in 
the right well. As shown in Fig. |3(c) - and as justified quantitatively by Eqs. 10 - 15 below - at 
the end of this stage the trajectories belonging to set I are trapped. 

During the second stage, r/4 < t < r/2, the left well drops down, trapping the trajectories in 
set iT. As this occurs, the trajectories in set / remain trapped in the right well. 

From r/2 < t < 3t/4:, as the right well rises and ultimately disappears, the trajectories in set 



/ gain energy (Fig. 3(e) ), and during the fourth and final stage, 3r/4 < t < r, all trajectories gain 



energy as the left well gradually rises until it disappears. The situation at t = r, shown in Fig. |3(f) 



reflects the swap that has occurred between sets / and II, relative to Fig. 3(a) 



Due to adiabatic averaging, the energy-ordering of the trajectories within each set remains fixed 

into a 



in the quasi-static limit: if we were to subdivide the lightly shaded region II in Fig. 3(a) 



stack of narrow horizontal bands, then the vertical ordering of these bands would remain unchanged 
throughout the process. 

A proper analysis of this process involves the theory of adiabatic invariants, with careful at- 
tention paid to the phase space separatrix that is present during the interval r/4 < t < 3r/4, 
when U(q) has a local maximum at q = [T3J, [T3] - However, the essence of what occurs should be 



s 



intuitively clear from the above discussion. A useful analogy is provided by imagining a container 
initially filled with three layers of a viscous, incompressible fluid, labeled I, II and III in verti- 
cally ascending order. Two syringes are attached to the bottom of the container. First one syringe 
extracts the lowest layer I of the fluid, bringing layer 77 to the bottom of the container. Next, the 
other syringe extracts layer //. Then the fluid layers are re-injected in the same order in which 
they were removed, resulting in the rearrangement of these layers. 

The incompressibility of the fluid in this analogy corresponds to Liouville's theorem: phase space 
volume is preserved under Hamiltonian dynamics. To justify quantitatively our assertion that the 
protocol X c (t) swaps regions / and II, we must show that the phase space volumes corresponding 
to the darkly shaded regions in Figs. 3(a)| and Figs. |3(d) are equal (in other words, it is precisely 
the trajectories in set I that get trapped in the right well), and similarly that the phase space 
volumes of the lightly shaded regions in Figs. 3(a) and Figs. 3(d)| are equal. 

Let Q(E; A) denote the volume of phase space enclosed by the surface H (z; A) = E: 



Q(E;X) 



dz 9 
dq 

E>U 



E-H(z;X) 



E-U(q;X) 



(10) 



where we have integrated over momentum to get to the second line. When either E = or A = 
the remaining integral can be evaluated analytically: 

-E 1 / 4 



dq ^/8{E - q A ) = V8E 3 / 4 I 

ft(0;A) 



-VX, 



dq^-W{q;\) 



3/2 ,3/2 



A7 +A 



n L + n R 



with Iq given by Eq. [7j The quantity 



A 



3/2 



(11a) 
(lib) 

(12) 



is the volume of phase space for which H < and q < 0, and f2/j(A#) is defined similarly for H < 
and q > 0. 



Using Eq. 11a, the phase space volumes of regions / and II, defined by Eq. [8j are 

fi J = V / 8E 1 3/4 J , n n = V8 (E 3 2 /A - E 3 ^ J (13) 



In Fig. |3(d) the lightly and darkly shaded regions correspond to phase space volumes ^l(A) and 



fi^(A), respectively, which are equal in value: 

n L (A) = n R (A) 



A 3/2 



(14) 
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Combining these results with Eq. [6] we find that 



n n = n L (A) 



(15) 



This establishes that our qualitative description of what occurs during this process, as illustrated in 
Fig. [3j is indeed consistent with the preservation of phase space volume, as mandated by Liouville's 
theorem. 

The picture developed in the preceding paragraphs suggests the following relationship between 
the initial (Ei) and final (Ef) energy of the system, in the limit r -4 oo: 



n(E f ;0) =n(Ei;0) + n n 
n(E f ;6) = n(Ei;0) - Oj 
n(E f ;6) = n(Ei;Q) 



if < E { < Ei 
if Ei < Ei < E 2 
if E 2 < Ei 



(16a) 
(16b) 
(16c) 



with Ei and E 2 = 2 4 / 3 I?i determined by the value of A (Eq. pi). Combining these results with 



Eq. 13 (note that = flu) we obtain 



E f 



(e^ + eI") 



3/4\ 4 /3 



e: 



3/4 



<) 4/3 



if < Ei < Ei 
if Ei<Ei< E 2 
if E 2 < Ei 



(17) 



As a test of Eq. 17 we sampled 10 5 initial conditions zq = (qo,po) from a microcanonical 
ensemble at energy Ei = Hq(zo) = 2.8, near the bottom of region 7Y (see Fig. [3]). For each initial 
condition zq we generated a trajectory zt by integrating Hamilton's equations as the parameters 



were varied as in Fig. 2(a), with r = 12000. The resulting distribution of final energies Ef = 
Hq(z t ), spanning a range from .E^mm = 0.0030 to -E/, m ax = 0.0150, was characterized by a mean 
value Ef = 0.0106 and a standard deviation ge s = 0.0014, in excellent agreement with the value 

(The small discrepancies reflect the fact that the duration 

some 



Ef = 0.0104 predicted by Eq. 



17 



r = 12000 is finite.) While these numerical results support the analysis leading to Eq. 17 



caveats are in order. In particular, Liouville's theorem itself rules out the possibility that all initial 
conditions with energy Ei = 2.8 lead to a net decrease of energy, Ef < Ei. We defer a discussion 
of this issue to Sec. IIVL 



To this point we have considered a symmetric protocol, Fig. 2(a), in which each well reaches 



the same maximal depth, determined by the value of A (Fig. |3(d)j ). However, the analysis is easily 



generalized to the asymmetric protocol shown in Fig. 2(b), in which the parameters are varied 
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U(q)r 



(a) t = 




(b) t = t/2 



o 
q 



(c) t = T 



FIG. 4. Similar to Fig. [3] but for an asymmetric protocol, Fig. |2(b)| with Ax, = 5.0 and A/j = 3.0. The phase 
space volume of set / (the darkly shaded region) remains constant, as does the volume of set II (lightly 
shaded), but the two volumes differ: Qi ^ fljj. 



around a rectangle with corners at (0,0) and (Al,Ar). Regions I, II and III are denned as in 
Eq. |8j but now the energies E\ and E 2 are defined by 

k 3/2 \ 4 /3 / 3/2 A 3/2\ 4 /3 

• E 2 (A)= { Ar + l 1 



■Ei (A) 



Ai 

3/n 



37, 



o 



(18) 



When the protocol is implemented quasi-statically, the net result is a rearrangement of sets I and 
II, as depicted in Fig. |4} Eq. [16] now leads to the result 



E f 



e\i* + eT 



eTY 



E 



3/4 



E; 



if < < Si 
if E 1 <E t < E 2 
if E 2 < Ei 



(19) 



The viscous fluid analogy also applies to this situation, only now the syringes remove different 
quantities of fluid, Qi ^ tin. Alternatively, the processes illustrated in Figs. [3] and [4] are analogous 
to a simple shuffle of a deck of cards, in which a stack of adjacent cards (region //) is removed 
from the middle of the deck and transferred to the bottom. 

It should now be clear how to design a quasi-static protocol that lowers the energy of the system 
almost to zero, for a given initial energy Ei = Ho(zq). Namely, we choose Aft such that Ei is slightly 
above E\, thus locating the initial conditions near the bottom of region 7T. If we then implement 
the protocol shown in Fig. [2j in either its symmetric (Ax, = Ar = A) or asymmetric (Ax ^ A/j) 
version, the system will be trapped near the bottom of the left well at t = t/2, and will end the 
process with Ef 0. This outcome is independent of the value of Ax, which simply determines 
the width (in energy) of region iT. 
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III. EXORCISING MAXWELL'S DEMON 

Let us now return to the perpetual-motion device of the second kind proposed in the Introduc- 
tion: after equilibrating the system with a thermal reservoir at temperature T (step 1), we measure 
the initial energy Ei (step 2), then choose a protocol that reduces the energy near to zero (step 
3) . The amount of work we extract during this cycle - equivalently, minus the amount of work we 
perform on the system - is given by 

Extracted =~W = E i -E f <E i (20) 

If we repeat this process many times, then the average work extracted per cycle satisfies 

(Extracted) < <^> = / dz / q (z ) H (z ) = | /T 1 (/T 1 = k B T) (21) 

where the canonical distribution p eq oc exp(— /3Hq) reflects initial equilibration with the reservoirj^] 
To approach this upper bound of (3/4)k B T per cycle, in which the thermal energy of the system 
is entirely converted to work (Ej = 0), the initial energy must be measured with high precision, 



allowing us to choose a protocol for which E% — E\(A) is tiny but positive (Eqs. 17, 19). However, 
as mentioned in the Introduction, these measurements generate information that must ultimately 
be erased, at a cost of /3 -1 ln2 per bit. There is a competition at play here: increased precision 
brings us closer to the maximal extracted work, but carries the penalty of increased accumulation 
of information. 

To address this issue, imagine a measurement apparatus that reports the initial energy of the 
system with finite precision. Specifically, given the initial microstate zq, the apparatus outputs one 
of K values associated with specified energy intervals A, B, C, ■ ■ ■ . Taking K = 4 for purpose of 
illustration, the apparatus outputs A, B, C, or D according to 

A : < H {z ) < E A 
B :E A < tf (z ) < E B 

(22) 

C :E B < H (z ) < E c 
D :E C < H (z ) 

where the values E A , E B , and Eq are fixed properties of the apparatus. 

Now consider the following strategy for choosing a cyclic protocol, based on the output of the 
measurement apparatus. 



In Eq. 21 we have used the identity (E) = -(d/dp)\nZ, with Z = J dz exp(-/3H ) = v / 8^r(5/4)/3" 3/4 . 
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Output = A: Do nothing to the system, as it is already in the lowest-energy interval. 



• Output = B: Using Eq. 18, set -Ei(A) = Ea and £2 (A) = Eb, that is choose (Al,Ar) so that 
interval B in Eq. 22 corresponds to region II in Eq. |8j Next, implement the asymmetric protocol 
of Fig. |2(b)[ under which initial conditions from this region are transferred to the bottom of the 
potential well, as in Fig. [4] 

• Output = C: Set Ei(A) = Eb and #2 (A) = Eq, then implement the asymmetric protocol. 
Again, the energy interval containing the initial conditions - interval C, in this case - is shuffled 
to the bottom of the potential. 

• Output = D: Set E\(A) = Eq and ^(A) = E* , where E* > Eq is an arbitrary cutoff energy, 
then implement the asymmetric protocol. In this case, initial conditions from the region between 
Eq and E* are transferred to the bottom of the potential, whereas if H(zq) > E* the protocol 
produces no net change in the energy of the system. 

This strategy takes advantage of the limited knowledge provided by the measurement of the 
initial energy. When it is implemented, the energy of the system decreases (that is, Ef < Ef) if 
Ea < Ei < E* , and remains unchanged otherwise. Thus, on average per cycle, work is extracted 
from the system, 

(W extract cd) > (23) 

and ultimately from the reservoir that replenishes the system's energy. 

Over N S> 1 repetitions of the process, the measurement apparatus generates a symbolic string 
of length N, of the form BDCCADA ■ ■ ■ . Letting Px denote the probability of outcome X G 
{A, B, C, D} in a given measurement, the number of bits required to encode this string is given by 

N bits = NH/ln2, (24) 

where 

U = -Y, p x^P x (25) 
x 

is the Shannon entropy of the measurement [IS]. Now, both (Wextractcd) and % depend on Ea, Eb 
and Eq-, and the former also depends on E* . In the following section we establish that, no matter 
what values these parameters take, the inequality 

(^ extracted ) < p- l U (26) 
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is satisfied. The extraction of work thus comes at the cost of the accumulation of information: on 
average, at least one bit is written per (3~ 1 In 2 of extracted work. |^] 

We now turn our attention to the eventual cost of erasing this information. By Landauer's prin- 
ciple, the average work required to erase one bit of information is no less than /3 _1 In 2. Therefore, 



since the number of bits generated per cycle is %/ln2 (Eq. 24), the average work required to erase 



the information accumulated in one cycle of operation satisfies 

(Erasure) > (27) 



Combining Eqs. 26 and 27 we find that the work required to erase the accumulated information 



exceeds - or at best, matches - the work extracted during the cycle: 

(W extra cted) < < (Erasure) (28) 



Thus our model obeys the Kelvin-Planck statement of the second law, as it had better do! Eq. 28 
highlights the two logically distinct steps we take in reconciling our model with the second law. 
Although the second half of this inequality chain (that is, Landauer's principle) is derived by appeal 
to the second law itself [10], the first half (Eq. [26| is obtained without assuming the second law: in 
Sec. Ill A we do not infer Eq. 26 by arguing that the second law demands it, rather we will derive 



this inequality directly. 



Eq. 26 is a special case of an inequality recently derived by Sagawa and Ueda (see Eq. 3 
of Ref. |16j or, in the quantum setting, Eq. 14 of Ref. [T7])> which generalizes the second law 
of thermodynamics to processes with feedback, such as the one considered in this paper. This 
inequality also follows readily from recent generalizations [161 [TH1 IT§] of the nonequilibrium work 
relation [2] and Crooks's fluctuation theorem [20] to nonequilibrium processes with feedback. In 
the following derivation, we do not directly invoke these results, instead we provide a self-contained 
analysis that is pertinent to our particular model. 

A. Bound on work 



Consider a cyclic process with the measurement apparatus described by Eq. 22 above. For initial 
conditions Zq, let z^(zq) denote the final conditions, after implementation of the cyclic protocol 



3 In the original Szilard engine, which involves a single particle in a chamber, this relationship is straightforward: 
the determination whether the particle is in the left or right half of the chamber produces exactly one bit of 
information, H = In 2, and standard thermodynamics gives the amount of work extracted during the subsequent 
isothermal expansion, Wextracted = P~ In 2. 
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corresponding to measurement outcome X £ {^4, B, C, D}. The work performed on the system as 
it evolves from zq to z^(zo) is given by 

W = H (z? (z )) - H (z ) (29) 

Over many repetitions of the process, with the protocol X determined by the measurement of 
initial energy, the average work performed on the system is 

A,B,C,D 



(30) 



where p eq oc exp(— /3Hq), and j Zo&x indicates integration over all microstates zq that result in the 
measurement outcome X. Eq. [30] can be rewritten as 

(dropping the subscript 0). Let us now define two functions 

p e %z)/P x ifzGX 
fx(z) = I (32) 

if z £ X 

g x (z)=p^(z)) (33) 

where Px = J z ^x P eq ( z ) ^ s ^ e probability that the outcome of the measurement is X. We can 
interpret fx( z ) as the probability distribution of initial microstates, conditioned on the outcome 
X. Moreover, f dzgx(z) = f dz^p cq (z^) = 1 (since phase volume is preserved, dz = dz^(z), by 
Liouville's theorem), therefore gx( z ) can also be interpreted as a probability distribution on phase 
space. 

With these definitions, Eq. 31 becomes 

(W)=r 1 E / dzP x / x (z)ln Px/ f (z) (34) 

= P' 1 E p * I dz /^( z ) ln f -^r\ + I 3 ' 1 E p * ln p * ( 35 ) 
x J gx[z) x 

The integral appearing in Eq. [35] is the relative entropy or Kullback-Leibler divergence between the 
distributions fx( z ) and gx( z )', this quantity is equal to zero if the two distributions are identical 
and is positive otherwise [15]: 



/ 



fxln^ = D[f x \\gx]>0 (36) 

gx 
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Thus the first sum on the right side of Eq. 35 is non-negative, hence 



(W) >f3- 1 Y,PxlnPx = -r 1 U (37) 



x 

which is equivalent to Eq. 26 the bound we set out to establish. 

The above derivation hinges on the non-negativity of relative entropy. A similar approach has 
recently been taken to obtain inequalities related to the second law of thermodynamics |21H24| . 
in situations when the system of interest does not necessarily begin (or end) in states of thermal 
equilibrium. (See also Ref. [23] for an alternative derivation of such inequalities.) 

While the calculation presented here assumes a measurement apparatus with four possible 
outcomes, it should be clear that the analysis generalizes to any finite number of energy intervals. 
In fact, we can even drop the assumption that the measurement is strictly correlated with energy. 
That is, suppose phase space is divided into N regions (not necessarily corresponding to energy 
intervals) and suppose that when the system is in microstate z, the measurement apparatus returns 
a value X that identifies the region of phase space to which that microstate belongs. Finally, a 
cyclic protocol is assigned to each possible outcome. It can be verified by the reader that the steps 



leading to Eq. [37| (equivalently Eq. 26 ) remain valid. 

Moreover, to this point we have considered a measurement apparatus that is error-free: if the 
initial microstate zo belongs in region X, then the measurement outcome is necessarily X. Let 
us now consider a more general situation in which P(X\zq) represents the probability that the 
apparatus outputs the value X, when a measurement is performed on a system in microstate zq. 
In the Appendix we analyze this scenario and derive the bound 

(TV extracted ) < prH (38) 

where X is the mutual information [TS] between the variable zq and X. For error- free measurements 



(e.g. Eq. 22), X = % and Eq. 38 reduces to Eq. 37 When the apparatus is capable of making 



errors, then X < % [15], which conforms nicely to the intuition that an error- prone measuring 



device degrades our ability to extract work from the system. In either case Eq. 26 remains valid. 

Finally, we note that the results derived in this section can be generalized to systems evolving 
according to stochastic equations of motion [26] . 



4 In fact, as long as our measurement apparatus has more than one possible outcome X, this result will be a strict 
inequality, since /x(z) = 7^ grx(z) for any z ^ X, hence -D[/x||ffx] > 0. 
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IV. DISCUSSION AND CONCLUSIONS 

The past few years have seen considerable interest in the thermodynamics of small systems 
and in the applicability of the second law to various nanoscale scenarios (see Ref. [27] for a recent 
review), including those involving feedback. Motivated by the recent work of Marathe and Par- 
rondo [6], we have studied a model single-particle system that is "cooled" under the quasi-static 
cycling of external parameters, when initial conditions are sampled microcanonically. We have used 
this model to construct a procedure for systematically harvesting energy from a thermal reservoir 
and converting that energy to work, in seeming violation of the Kelvin-Planck statement of the sec- 
ond law. This procedure, however, involves the repeated measurement of the energy of the system. 



Modeling the measurement apparatus in Sec. Ill, we have shown by explicit calculation that the 



average work delivered per operating cycle does not exceed the average work that must eventually 
be expended (in accordance with Landauer's principle) to erase the information acquired in the act 
of measuring the initial energy. Thus on balance the Kelvin-Planck statement remains satisifed. 

Our model illustrates the idea - which traces back to Maxwell and Szilard - that knowledge 
about the microscopic state of a system can be exploited to circumvent the second law of ther- 



modynamics, loosely speaking [8]. In this setting, Eq. 37 places a bound on the work that can be 
extracted during a cyclic process, following a measurement that provides information about the 
initial state of the system. As already mentioned, similar bounds have been obtained and studied 
in the past few years, both for quantum systems |17l [28Tl30| and for systems evolving according 



to stochastic equations of motion [TBI HHJ HHJ I3TH37] . We also note that Eq. 35, a precursor to 



Eq. 37, generalizes the relative entropy work relation of Kawai, Parrondo and Van den Broeck [38J 
to processes with feedback. 

Let us now return to a point mentioned in Sec. [TTJ the apparent incompatibility of Eq. [i~7| with 
Liouville's theorem. Consider a single energy shell, that is the set of all points zq with a particular 
value of energy E{ = Hq{zq). This set, which we denote Si, has the topology of a simple closed loop 
in phase space. Let us assume that this energy shell is located in region II, hence E\ < Ei < E<±. 



If we evolve trajectories from initial conditions in Si, using the protocol in Fig. 2(a) , we arrive at 
a set of final conditions, Sf, which also has the topology of a simple closed loop: 

Si = {z \H (z ) = Ei} -> S f = {z T (z )\Ho(z ) = E l } (39) 



By Liouville's theorem, these loops enclose equal volumes of phase space: fi[<S/] = fi[<Si]. This, 
however, is incompatible with a literal interpretation of Eq. 17, which seems to assert that every 
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initial condition with energy E{ leads to a net decrease of energy, Ef < Ei, in other words that 
Sf is contained entirely in the interior of Si. To address this apparent contradiction, we sketch a 



more careful interpretation of Eq. 17 



For any finite duration r, there exist some initial conditions zq G Si that yield trajectories 
for which the system's energy increases: Hq(z t {zq)) > Ei. We will refer to these trajectories as 
"bad actors" , as they spoil the picture shown in Fig. [3| [^] While bad actors exist for any finite 
r, the probability to generate one of these trajectories generally decreases with increasing r, for 
initial conditions sampled microcanonically from Si. We have observed this trend in numerical 
simulations over a range from r = 1200 to 2000 (data not shown); and as mentioned in Sec. [TlJ 
for Ei = 2.8 and r = 12000 no bad actors were observed among 10 5 trajectories. Thus for large 
but finite r, we expect Sf to be a highly convoluted, closed loop - necessarily enclosing the same 
volume of phase space as Si - with much of the loop concentrated at low energies near the value 



predicted by Eq. 17, but with tendrils reaching into the region of energies higher than than Ei. We 
believe this issue deserves a more careful treatment, but this is beyond the scope of the present 
paper. We end with a conjecture regarding the quasi-static limit: 

lim P \H (z T ) -E f \<~ =1 for any e > (40) 

where the quantity inside the limit is the probability to generate a trajectory whose final energy falls 



within an interval of width e around the value predicted by Eq. 17, and microcanonical sampling 
at energy Ei is assumed. We believe this conjecture represents the proper way to understand the 
validity of Eq. 17 and Fig. [3j Similar comments apply to Eq. 19 and Fig. |4j 
Our results suggest several avenues for future research. 

First, it would be interesting to explore a quantum-mechanical version of our model system. 
Here, the possibility of tunneling between the left and right wells introduces a new aspect to the 
problem, possibly spoiling the picture developed in Sec. [XT] by preventing particles from getting 
trapped. 

Because the protocols discussed in Sec. [TT| involve the quasi-static cycling of external parameters, 
it is natural to wonder whether the swapping of regions / and II (illustrated in Fig. [3]) can be 
described in terms of a geometric phase. 



Finally, we have not explicitly modeled the "demon" in Sec. Ill Instead, we have assumed 
the existence of some mechanism by which a particular outcome of the measurement leads to the 



In simulations, we have observed bad actors that begin near the bottom of region II, but get trapped in the right 
well at the end of the first stage of the process, e.g. just before t = r/4 in Fig. [3] As a result, they do not get 
drawn into the left well during the second stage. They subsequently "float" on top of the darkly shaded set I in 
Fig. |3j and end the process with Hq(z t (zq)) ~ E^- 



18 



implementation of the corresponding protocol. It would be interesting, however, to model this 
mechanism explicitly within a Hamiltonian framework, either by introducing additional degrees of 
freedom to model the demon or by specifying coupling terms between the measurement device and 
the system. In this case, we anticipate that the bound on extracted work will be given in terms 
of the correlation between the state of the system and the state of the measuring device and/or 
demon [251 ES ED E3] ■ 
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Appendix A: Analysis of error-prone measurement devices 

Consider a measurement apparatus with a discrete set of possible outputs, X = A, B, C, ■ ■ ■ , 
and let P(X\zq) denote the probability to obtain outcome X, when the measurement is performed 
on a system in microstate zq. We assume that every measurement produces some outcome, hence 
P(-^\ z o) = 1 f° r an y z o- As before, a cyclic protocol is chosen based on the outcome of the 
measurement. For initial conditions zq, let z^(zq) denote the final conditions, after implementation 
of the protocol corresponding to outcome X. The work performed on the system is given by Eq.|29[ 
and averaging over many repetitions of the process gives us 

(W) = f dz p cq (z ) P(X\z ) (flb(z*(z )) " H (z )) (Al) 



where P(zq,X) is the joint probability that the system is initially in microstate zq and the mea- 
surement outcome is X. Dropping the subscript 0, we now introduce two probability distributions 



(compare with Eqs. 32, 33) 



f x (z) ee P(z\X) = P(z, X)/P X (A3) 
2x(z) ^P cq (z*(z)) (A4) 
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where Px = J dz P(z, X) is the net probability to generate the outcome X, and P(z\X) denotes 
the conditional probability distribution that the initial microstate is z, given the measurement 
outcome X. In terms of these distributions we now have 

7x(z) PxP cq (z) 



(W)=p- 1 Y^ [ dzP(z,X)-. , ,. . , 

I9x{i) Pxfx{z) 

^/T^x/dz/^m^-r'E/dzPCz^ln P(Z ' X) 



> oq (z)Px 



(A5) 
(A6) 



On the last line, the first term is a relative entropy, and therefore non-negative; while the second 
term (apart from the factor /3 _1 ) is the mutual information between z and X. We thus arrive at 



(W) > -p~ x T 



(A7) 



equivalently Eq. 38 
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